arXiv:1505.06553v2 [cs.IT] 30 Oct 2015 


1 


ML Detection in Phase Noise Impaired SIMO 
Channels with Uplink Training 

Antonios Pitarokoilis, Student Member, IEEE, Emil Bjornson, Member, IEEE, 
and Erik G. Larsson, Senior Member, IEEE 


Abstract 

The problem of maximum likelihood (ML) detection in training-assisted single-input multiple-output 
(SIMO) systems with phase noise impairments is studied for two different scenarios, i.e. the case when 
the channel is deterministic and known (constant channel) and the case when the channel is stochastic 
and unknown (fading channel). Further, two different operations with respect to the phase noise sources 
are considered, namely, the case of identical phase noise sources and the case of independent phase 
noise sources over the antennas. In all scenarios the optimal detector is derived for a very general 
parameterization of the phase noise distribution. Further, a high signal-to-noise-ratio (SNR) analysis is 
performed to show that symbol-error-rate (SER) floors appear in all cases. The SER floor in the case of 
identical phase noise sources (for both constant and fading channels) is independent of the number of 
antenna elements. In contrast, the SER floor in the case of independent phase noise sources is reduced 
when increasing the number of antenna elements (for both constant and fading channels). Einally, the 
system model is extended to multiple data channel uses and it is shown that the conclusions are valid 
for these setups, as well. 
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I. Introduction 

The demand on wireless data services is expected to increase significantly over the next 
decade. Hence, next generation wireless networks must provide substantially larger data rates. 
Recently, it has been shown that massive multiple-input multiple-output {Massive MIMO) can 
provide substantial gains in spectral efficiency and radiated energy efficiency [|Il-[|3l. In Massive 
MIMO, K non-cooperative users are served by a base station (BS) with M BS antennas over 
the same time and frequency resources. When M is significantly larger than K (e.g., one order 
of magnitude) linear transmit and receive processing techniques are close to optimal and the 
minimum required radiated power can be reduced as a function of M when a fixed information 
rate is desired lf4l. If5l. 

In Massive MIMO, the BS uses estimated channel impulse responses to coherently combine the 
received uplink signals. The quality of the estimated channel state information (CSI) has direct 
impact on the performance of Massive MIMO systems. Hardware impairments further degrade 
the acquired channel knowledge. In addition, the deployment of Massive MIMO systems requires 
the use of inexpensive hardware, so that the monetary cost remains low. Such equipment is likely 
to have limited accuracy. Hence, the study of the impact of hardware impairments is of particular 
importance and relevance in Massive MIMO systems. Recently, this area has attracted significant 
research interest [0. 

An unavoidable hardware impairment in wireless communications is phase noise. Phase noise 
is introduced in communications systems during the upconversion of the baseband signal to 
passband and vice versa due to imperfections in the circuitry of local oscillators. Ideally, the 
local oscillators should produce a sinusoidal wave that is perfectly stable in terms of amplitude, 
frequency and phase. In the frequency domain that would correspond to a Dirac impulse located 
at the carrier frequency. However, the phase of the generated carrier of realizable oscillators 
typically fluctuates. This is manifested by a spectral widening around the carrier frequency 
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in the power spectral density of the local oscillator output. Phase noise can cause significant 
degradation in scenarios where it varies faster than the channel fading. This happens when the 
variance of the phase noise innovations and the coherence interval of the channel fading are 
large E Section IV.C]. Some scenarios where the phase noise degradation dominates over the 
degradation due to channel variation are fixed indoor communication and Line-of-Sight (LoS) 
communication, such as WiFi at millimeter-wave frequencies and wireless broadband-to-home 
services, respectively. Further, phase noise causes a random rotation of the information signal, 
i.e. it is a multiplicative distortion. This makes the analysis and mitigation of the phase noise 
considerably more involved in comparison to additive distortions, such as quantization noise and 
generic non-linearities. In fact, it appears that Massive MIMO systems are less robust to phase 
noise than hardware impairments modeled as additive distortions |I71. 

The problem of calculating the capacity of phase noise impaired systems is particularly 
challenging. Closed-form expressions are not available even for the simplest cases. In [O the 
author derives the first two terms of the high signal-to-noise-ratio (SNR) expansion of the 
capacity of a phase noise impaired non-fading single-input single-output (SISO) system for any 
phase noise process that is ergodic, stationary, and has finite differential entropy rate. In |0 
the first two terms of the high-SNR capacity expansion for the block memoryless phase noise 
channel are derived. In ifTOll a high-SNR capacity upper bound for the Wiener phase noise 
MIMO channel is derived. Recently, the authors in IfTTl report approximate upper and lower 
bounds on the high-SNR capacity for the multiple-input single-output (MISO) and single-input 
multiple-output (SIMO) phase noise channels and compare the cases where separate and common 
oscillators are used. Lower bounds on the sum-capacity of multi-user Massive MIMO systems 
with linear reception and phase noise impairments are recently reported in [|71, ifT^ and [[T3ll . 

The problem of data detection in non-fading channels with phase noise impairments has been 
extensively studied in the literature. In [[T4ll the optimal binary detector for partially coherent 
channels is derived. Detectors that are optimal in the high-SNR regime are derived in [[T5]| . In IfT^ 
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the problem of optimal symbol-by-symbol (SBS) detection in SISO systems is investigated when 
the carrier phase is unknown and it is shown that the computational complexity is prohibitive 
in the general case. A suboptimal but implementable algorithm is derived when the unknown 
carrier phase stays constant for a block of consecutive symbols and its performance is compared 
to the case of exact carrier phase information. In ifTTll a simulation-based phase noise model 
is used and the existence EVM floors for SISO systems is shown. The authors of IIT^ derive 
algorithms for SISO phase noise channels without fading based on factor graphs and the sum- 
product algorithm. An extension of this work for single-user MIMO systems is given in [|T^ . 
where an estimate of the channel is inserted into the likelihood function as if this estimate 
were equal to the true channel-resulting in a mismatched detector. A set of soft metrics for the 
single-user non-fading phase noise channel under various assumptions is derived in ll20ll and 
their performance is compared. In [|2T]| an algorithm for joint data detection and phase noise 
estimation is derived for single-user MIMO systems and its performance is compared to a derived 
Cramer-Rao bound. 

Even though the problem of phase noise in communication systems is extensively studied, 
there are still many open questions. In particular, the effect of phase noise on beamforming is 
not fully understood yet. In ll22ll the authors study the effect of phase noise in the error vector 
magnitude (EVM) of an antenna array. They show through analysis and measurements that the 
EVM at the direction of the main lobe is reduced when independent phase noise sources are 
used. In IfT^ achievable rates for the uplink transmission of Massive MIMO systems with time- 
reversal maximum-ratio-combining in frequency-selective channels are derived for the case of 
identical and independent phase noise sources. It is observed that the use of independent phase 
noise sources at the BS results in an increased achievable rate. This result is also supported by a 
toy example, where the actual capacity can be easily computed and shows that the capacity with 
independent phase noise sources is larger that the capacity with a single phase noise source. 
A similar result is reported in Q, where the authors show that the phase noise variance can 
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be allowed to increase logarithmically with M when independent phase noise sources are used 
without losing much in performance. This is not true in the case of identical phase noise sources. 
Finally, the authors of ifTTI observe that the phase noise number (i.e., the second term in the 
high-SNR capacity expansion) is higher in the case of independent phase noise sources. The 
effect of phase noise in linear precoders in downlink Massive MIMO systems is studied recently 
in [|^ and in In the effect of imperfect hardware and the number of oscillators 
is investigated in distributed Massive MIMO downlink systems and it is shown that superior 
performance is achieved with independent phase noise sources. 

The results in previous works are not conclusive in the sense that they involve lower or upper 
bounds on the capacity of the investigated systems. These bounds are useful as the calculation of 
the exact capacity is an exceedingly complicated problem, however, they do not provide us with 
a conclusive answer on the fundamental difference between the choice of a single or multiple 
local oscillators. Motivated by that, we rigorously derive the optimal detector in phase noise 
impaired SIMO systems with uplink training for various cases of interest. Our findings explain 
the effects noted in the prior work; that is, the fact that superior performance was observed in 
the case of independent phase noise sources. This phenomenon becomes particularly apparent 
in the high-SNR regime and for a large number of antennas. 

We consider a single-user single-input multiple-output (SU-SIMO) system with phase impair¬ 
ments at the multi-antenna receiver. The maximum likelihood (ML) detector is derived under 
two assumptions on the phase noise processes, namely, when identical {synchronous operation) 
and independent phase noise processes (non-synchronous operation) are assumed. The detectors 
are explicitly given under the assumptions of either constant or fading channels. The phase noise 
impairments are modeled so that most reasonable distributions of the phase noise increments 
can be treated in a unified framework. A high-SNR analysis is provided in all the examined 
scenarios and conclusions are drawn with respect to the symbol-error-rate (SER) performance 
of the detector when the number of receive antennas, M, increases. We observe that in the 


6 


Pilot Data 


1 1 

Fig. 1: Two slot transmission protocol. 


synchronous operation, an SER floor due to phase noise appears for both assumptions on the 
ehannel fading. The SER floor depends on the severity of the phase noise impairments and is 
independent of the number of reeeive antennas, M. An SER floor at high-SNR appears also in 
the non-synehronous eases. However, it is shown that this floor ean be made arbitrarily small 
by inereasing M. 

II. System Model 

In this paper, a single-antenna user eommunieates with a BS equipped with M antenna 
elements, whieh are impaired with phase noise. A simple transmission protoeol is eonsidered, 
whieh eonsists of two ehannel uses (see Eig. [T]). In the first ehannel use a known symbol (pilot) 
is transmitted and in the seeond an unknown information symbol is transmitted. Two different 
eases are treated with respeet to the knowledge of the wireless ehannel. Namely, in the first 
ease, termed as constant channel (CC), the ehannel is assumed deterministie and known at the 
reeeiver [l25ll . If2^ . Henee, the transmitted symbol is observed in the presenee of only additive 
noise and multiplieative phase noise. In the seeond ease, termed as fading channel (FC), the 
wireless ehannel is, additionally, assumed unknown at the reeeiver and Rayleigh fading. We start 
with the deseription of the CC for simplieity and subsequently we deseribe the extension to the 
EC. 

A. Constant Channel (CC) 

During the first ehannel use, the reeeived eomplex baseband symbol, Xm, at the m-th BS 


antenna is given by 
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Xm = + Wm, 171 = 1, . . . ,M, 


( 1 ) 


where Wm is the m-th component of the additive white Gaussian noise (AWGN) vector, w, 
distributed as a circularly symmetric complex Gaussian random vector, CAf{0, Im), dm is the 
unknown initial phase reference uniformly distributed in the interval [—tt, tt). The positive scalar 
gm is a known amplitude and p is the measured SNR at the m-th BS antenna when = 1- In 
the second channel use the received symbol at the m-th BS antenna is given by 


Vm = + Zm, m = 1,..., M, 


( 2 ) 


where Zm is the m-th component of the additive white Gaussian noise vector 2 ~ CAf{0, Im), 
s is the transmitted information symbol selected from a constellation S, such that E[s] = 0 and 
E[|sp] = 1. The real random variable, cfim, is a phase noise increment, which is independent of s, 
9m, and the AWGN. Since (j)m G [— tt , tt ), a general representation is adopted for the probability 
density function (pdf), p<s,^i<Pm), of 4>m, based on its Fourier expansion 



(3) 


where am,i, I = 0,1,2,..., are known real constants GTll . itlSll . Due to the wrapping of 
0m G [—TT, tt) the pdf is periodic and the Fourier series expansion exists. The Fourier expansion 
in (|3]) can represent any pdf in [—vr, tt) that is continuous, differentiable, unimodal, even and has 
zero mean. That includes the circular normal distribution (also known as von Mises or Tikhonov 
distribution) and the wrapped Gaussian distribution ll29l . These two models are predominantly 
used in the literature to describe phase noise of practical oscillators. For free-running oscillators 
phase noise is often modeled as a discrete-time Wiener process, where the increments are 
i.i.d. wrapped Gaussian increments |[30ll . For oscillators equipped with a PLL, the phase noise 
increment is well modeled by a random variable from a von Mises (or, equivalently, Tikhonov) 
distribution IfTTl . 

The setup presented in ([T]) and Q can model two distinct operations, with respect to phase 
noise. In the first operation, the random variables 9m and 4>m are independent across the BS 


8 


antennas {non-synchronous (NS) operation). This models a praetieal distributed antenna deploy¬ 
ment where the use of a separate oseillator per BS antenna is required. In the seeond operation, 
it holds that 9i = ■ ■ ■ = 9 m and = • • • = 0 m (synchronous (S) operation). This eorresponds 
to a praetieal eentralized deployment where the same oseillator is used for the downeonversion 
of the received passband signal to the baseband for all BS antennas. Hybrid topologies are also 
possible, however, they are less interesting to analyze since their SER is expected to lie between 
the SER of the S and NS operations. Hence, they are not considered in this work. Einally, ([U) 
and (I2l) can be expressed in vector-matrix form as 

X = y/p@g + m, 

y = y/pQ^gs + z, (4) 

where x = [xi,..., xmY, y = [vi, • • •, VmY, © = diag{e^'^0 . .., $ = diag{e^'‘^0 ..., 

g = [^fi,..., qmY is ths vector of known amplitudes and (-Y is the transpose operation. 

B. Fading Channel (FC) 

Eor the EC case, during the first channel use the signal received at the m-th BS antenna is 
given by 

\/P^m T Wmi (5) 

where hm is the m-th component of the zero mean circularly symmetric complex Gaussian 
channel vector, h ~ CAA(0, Im)- The initial phase 9^ that is present in ([T]) is absorbed in hm- 
This can be assumed without modifying the statistics of hm, due to the circular symmetry of the 
channel distribution. During the second channel use, the signal received at the m-th BS antenna 
is given by 

2/m hfa^ T ^m 


( 6 ) 




The phase noise inerement, 0^, is defined as in (|3]). The fading eoefficients hm remain eonstant 
over both ehannel uses. Further, the S and NS operations are defined similarly as in Seetion 
III-AI Finally, the veetor-matrix form of ([5]) and db]) is given by 


X = ^/ph + w, 


y = y/p^hs + z. 


(7) 


III. Optimal Detectors 


In this seetion we describe the maximum likelihood (ML) detector for each of the four different 
cases described in Section |nl The maximum a posteriori (MAP) detector is identical to the ML 
detector, up to an additive constant dependent on the priors of s, and hence can be derived by 
trivial modification of the ML detectors. The optimal detectors are summarized in Table U for 
clarity. The BS uses the received vectors x and y jointly, to derive the optimum estimate, s, of 
the transmitted information symbol, s, i.e.. 


. A 
S = 


argmax p{x^y\s 

s^S 


( 8 ) 


We start with a proposition on the likelihood function for the NS operation. 

Proposition 1: The pdf of the received vectors (a;, y) given a symbol s for the NS operation 
is given by 



(9) 


where for the CC-NS case we have 


A = exp (—lla: 


2 



2 


P(1 + kn ll^f) 


( 10 ) 




( 11 ) 


Cm = arg(|/m) - arg(xm) - arg(s) 


( 12 ) 
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and for the FC-NS case we have 


A = 


i+phr ||j.||2 i+p I 
e p+p|si^+i p+p|s|^+i 


(tt^ (p + p|sP + 1)) 


M 


f^m,l (^171,1^1 




(13) 

(14) 


+ p|sp + 1 

and C,m as in (O. h{-) is the /-th order modified Bessel function of first kind ll3T]| . (•)* is the 
complex conjugation operation and || ■ || is the Euclidean norm. 

Proof: See Appendix lA-AI for the CC-NS case and Appendix lA-BI for the FC-NS case. ■ 
Based on the result in Proposition [H the detector for the NS operation can be derived. 
Corollary 1: For a discrete constellation, S, the ML symbol, s, for the NS operation 


M / oo 

s = arg max = arg max B +y In + 2 / f3m,i cos {l(„ 

s£S s£S ^ \ ’ ^ 

m=l \ 1=1 


(15) 


where is the decision metric for the symbol s G S. For the CC-NS case we have 


B = -p|sp ||9||^ 


(16) 


(5m,i as in (fTTI) and Cm as in (fT^ . For the FC-NS case we have 

1 + P\s? 


B = —M In (l -f p -f p|sp) — 




1 + p 


\y\ 


(17) 


l + p + p\s\‘^ l + p + plsl"^ 

j3m,i as in (fT4l) and Cm as in (fT^ . 

Proof: The result follows trivially from Proposition [T] by taking the natural logarithm of 
(HU), \n{p{x,y\s)), and dropping the terms that are independent of s. ■ 

The results in Corollary [T] hold for arbitrary constellations. In the following we particularize 
Corollary [T] for the case of phase shift keying (A^-PSK) constellations. A rigorous motivation 
for this choice is deferred to Section IIV-AI 


Corollary 2: For s selected from an A^-PSK constellation (i.e., s G {e-^ jv }^^ q ^) for the NS 
operation, the ML detection rule is given by (fT5l) with B as in (fT^ for |sp = 1 


(3m,l = Oim,lh (2i/pfi'mkm|) f {‘^y/p9m\yra\) and Cm = arg(|/m) - eiTg{Xm) “ 


27rn 


N 


(18) 
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for the CC-NS case and B as in (fT71) for |sp = 1 

2p+l 


^ and Cm = arg(|/m) - arg(xm) 


I3m,l (^771,1^1 


2'Kn 


(19) 


for the FC-NS case. 

The counterparts of Proposition [T] and Corollaries [U and [2] for the S operation are provided in 
the following. 

Proposition 2: The pdf of the received vectors (cc, y) given a symbol s for the S operation 
is given by 


y\s) = A (/3o + 2 ^ /3z cos (/() 


1=1 


where for the CC-S case we have A as in (fTOl) . 


( 20 ) 


Pi = aili{2^\s* y\)Ii{2^\g^ x\), (21) 

C = arg(c/^7/) - axg{g^x) - arg(s) (22) 


and for the FC-S case we have A as in (fT3l) . 



(23) 

C = &Tg{x^y) - arg(s) 

(24) 


where is the complex conjugation and transposition operation. 

Proof: See Appendix lA-CI for the CC-S case and Appendix lA-DI for the FC-S case. 
The detector for the S operation is given by Corollary [3l 

Corollary 3: For a discrete constellation, S, the ML symbol, s, for the S operation is 

/ OO 

s = arg max = arg max i? -f In /)o -f 2 > Pi cos {l() 



(25) 


where £f is the decision metric for the symbol s G 5. For the CC-S case we have B as in (fT^ . 
Pi as in (I 2 TI) and ( as in (l22l) . For the FC-S case we have B as in (fTTl) . Pi as in (|2^ and ( as 
in dal. 
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cc 

FC 

NS 

Corollary [T] 

Corollary [T] 

([H}, ([Ull, 0. Gill 

(in, (OUl, d. CSi 

S 

Corollary [3 

Corollary 

(|25j, Oil. (mj. (in 

(d, OB- 01 


TABLE I: Summary of optimal detection rules for all the cases under investigation. 


Proof: The proof is done similarly to the proof of Corollary [T] ■ 

For an A^-PSK eonstellation the deteetor in Corollary [3] can be particularized as in Corollary HI 
Corollary 4: For s selected from an A^-PSK constellation (i.e. s G for the 

synchronous operation, the ML detection rule is given by (l25l) with B as in (fT^ for |sp = 1 

^TTT) 

I3i = aik {2^\g^x\) Ii {2,/p\g'^y\) and C = arg(t/^y) - arg(c/^ai) - (26) 

for the CC-S case and B as in (fTVl) for |sp = 1 

^ f2p\x^y\\ / ff N 27m 

A = aih ^ ^ ] and C = &Tg{x^y) - ^ (27) 

for the FC-S case. 

A. Implementation of (fTSl) and (l25l) 

At this point, a comment on computational complexity and implementation issues of the 
detectors in Corollaries [H and [3] is in order. For the S operation, the calculation of the optimal 
detectors requires the computation of inner products between vectors of size M. For the NS 
operation, the computational complexity scales also linearly with M. Hence, the complexity of 
the optimal detectors scales in all cases linearly with the number of BS antennas. From (fTSl) 
and (|251) . the ML detectors involve the calculation of infinite series, where the /-th term is a 
function of the modified Bessel functions, /«(•)• However, the implementation of these detectors 
is still feasible by appropriate truncation of the infinite series. Since the series converge very 
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I (x) 

Fie. 2: Ratio of modified Bessel functions ! and the 

O Il{x) 

bound from Lemma [T] for ^ = [2, 3, 4] as a function 
of X. 


I fee') 

Fig. 3: Ratio of modified Bessel functions and the 
bound from Lemma [T] for a set of fixed values of x as 
a function of /i. 


fast, the truncation error can be made negligible. The following lemma is useful to demonstrate 
this claim. 

Lemma 1 ( Il32]| . |[33l): For fi > u > 0 and a; > 0 it holds that 


lujx) 


> max 




(28) 


where r(-) is the Gamma function r(^) = /q°° e ^df, defined for > 0 |l34l 8.310]. 
The ratio and the bound in Lemma [Dare plotted as a function of the argument x for various 
values of fi in Fig. [2l For all /r > 1 and x it holds that < 1 and for small to moderate values 
of X we even have 1. Further, the ratio is monotonically decreasing in /i when x 

is fixed. In fact the rate of decrease is quite fast, since, for any integer /i, r(/i) = (/i — 1)! and 
x^ grows at a lesser rate than the factorial function. The rate of decrease as a function of /r for 
fixed X is shown in Fig. (3] This fast convergence establishes the fact that only a few terms of 
the infinite sums are required so that the approximation error is negligible. 
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B. The ML Detector for von Mises Phase Noise Increments 

We conclude the section by particularizing the results in a special, but important, case, where 
the optimal detector structure does not require the implementation of an infinite series. This 
involves the FC-NS and FC-S cases. 

Proposition 3: For the fading case with ~ VA^(0, k), distributed as zero mean von Mises 
random variables with concentration parameter n, the likelihood functions can be expressed for 
the NS operation as 

M + + 2k (b^5) cos (Cm)] 

p{x,y\s) = AY[ -—--^ (29) 

^=1 

and for the S operation as 

lo + (b‘^)^ + 2 k (b'S') cos (C)'j 

y|s) = A - —— -(30) 

loiK) 

where A is defined as in Cm as in dill) and C as in 

Proof: See Appendix lA-El ■ 


IV. High SNR Analysis 

A. High SNR Analysis for the Synchronous Operation 

The expressions in Propositions [B [2] and Corollaries [IM] can be easily implemented, however, 
it is hard to extract insight on the fundamental behavior of the different operations. Therefore, 
in this section we present an asymptotic analysis as p ^ oo (high-SNR) for the system models 
in dH) and d21) in order to reveal the high-SNR behavior of the detectors. We start with the 
synchronous operation as it appears to be simpler. The system model for the CC-S as p — )■ cxo 
can be expressed as 

y = lim y = xe^^s 



-0 = arg(s) + 0 


(31) 
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where x = linip^oo = ^^^9 and = arg {x^y). From (|3TI) it is apparent that the amplitude 
of s ean be deeoded error-free. Henee, distinguishing two symbols that have different amplitude 
is trivial at high-SNR. However, distinguishing two symbols that have different phase is mueh 
more ehallenging. Therefore, in the following, we restriet the study to eonstellations that eneode 
information only in the phase, that is PSK. Two-dimensional eonstellations, sueh as 16-QAM 
or 64-QAM, ean be treated as eonsisting of multiple disjoint sub-eonstellations, where the 
information is eonveyed only on the phase as explained in the following. Points of the two 
dimensional constellation that are at different distances from the origin belong to different sub¬ 
constellations with different radii and hence are easily distinguishable at high-SNR. Constellation 
points that are at the same radius belong to the same sub-constellation and their pairwise error 
probability can be calculated in a manner similar to PSK constellations. The observation '0 in 
(|3TI) is in this case sufficient statistics for the detection of arg(s). We proceed by deriving the 
asymptotic SER at high-SNR for PSK constellations. 

Proposition 4: For the CC-S case, let cj) be zero mean random variable with pdf as in 
dU), where the distribution is unimodal and symmetric around the meanly Define the error event 
e = {arg(s) 7 ^ 0|arg(s) = 0}. Then the SER floor at the high-SNR for equiprobable A^-PSK 
symbols is given by 


Pr {«} ^ 1 - I" p^(4,)d,l, = 1 - ^ ^ sill (;^) , (32) 

•^-N 1=1 

Corollary 5: From (1^ we observe that there is a non-zero SER floor for the CC-S case, 
which depends only on the statistics of the phase noise increment and the PSK constellation 
density, N, but is independent of the number of receive antennas, M. 

Remark 1: The preceding analysis is also true for FC-S by defining x = h and y = he^^s. 
Observe that the statistic corresponding to the amplitude (|3TI) is now \x^y\ = ||h,|p|s|, where 


1 


This assumption is not restrictive in practice as the widely accepted models on the phase noise impairments, such as the 


Wiener model, satisfy it. 
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h is stochastic. This generally has an impaet of the performanee in FC-S, however, in the high- 


SNR regime the effeet of the phase noise inerement dominates and the analysis for CC-S applies 


in FC-S, as well. The performance of FC-S approaehes that of CC-S as M —oo due to the 
hardening of ]g||^|p E[|/rip] almost surely ll^ . 

B. High SNR Analysis for the CC-NS case 


For the CC-NS ease based on the model in ([T]) and Q we define = limp^oo ^ and 


yrn = linip^oo By proeeeding similarly to Seetion HV-AI it ean be shown that the amplitude 


ean be decoded error free so we concentrate on the transmitted phase, whieh for the m-th BS 


antenna element is given by ijjm = axg{x*^yrn) = 4>m + arg(s). The general parameterization 


of the phase noise inerement in ([3]) does not yield mathematically tractable expressions. To 
get a mathematieally traetable expression, we eonsider that the inerements fm are independent 
VAi(0, k) random variables for m = As noted earlier, the von Mises distribution 

is used to describe phase noise resulting from oscillators that are equipped with PLL. It is 
symmetrie around its mean, unimodal and is representative of the antieipated statistieal behavior 
of phase noise. Henee, we expect that the general insights drawn from this choice will hold for 
other similar distributions. The likelihood funetion of the observed phases = [fi,... 
given the transmitted phase arg(s), ean be expressed as 


g'^Em=l cos(yi„,-arg(s)) 





and the eorresponding ML deeision rule for symbols seleeted from some alphabet S is given by 



(33) 


m=l 


Proposition 5: The deeision metric, jj^n, for the symbol n' from an A^-PSK eonstellation 
based on (l3^ is given by 



— cos 



M 


(34) 
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If we denote by e the error event, i.e., the ease where the deteeted symbol s is different from 


the transmitted symbol s, then the SER for equiprobable input symbols is given by 

( N-l 

IJ > 0} arg(s) = 0 


V n=l 


(35) 


The exaet ealeulation of the probability of error in (1351) appears formidable. We therefore 
derive an upper bound on the pairwise symbol error probability of erroneously deteeting = 
exp (j^) , n = — 1 when Sq = 1 was sent. We note that due to the symmetry of 

the von Mises distribution around its mean and the uniform priors on the input symbols, the 
eonditioning on any partieular input symbol does not affeet the result. The symbol sq = 1 with 
arg(so) = 0 is seleeted for eonvenienee. From (l34l) the is a sum of bounded independent 
and identieally distributed (i.i.d.) random variables = sin (^) sin Henee the 

following lemma ean be used. 

Lemma 2 {Bernstein Inequality |[36l): Let m = 1,..., M, be i.i.d. random variables 

with E[Xm] = 0, \Xm\ < C almost surely for some bounded eonstant C, Xg = and 

= ^VAR(Xs). Then for all f > 0 

Pr {Xg > tq} < exp 


2 +Iff 


Proposition 6: The pairwise error probability for the deteeted symbol to be Sn = exp (j^), 
n = l,...,iV — 1, given that the symbol so = 1 was sent is upper bounded by 


Pr {fin > 0|s = 1} < exp 


\ i/VAR(X„i,n) 


V 


2 + 


3 VAR(X,„,„) Io{k) 


(36) 


/ 


where C = sin (^) + sin^ (^) and 

VAR(A„,„) = (-) (-) (l - ^ 

Proof: Let Xm,n = sin (^) sin [ipn, - 7 ^) + sin^ (^) Then = 0, 

< (P = sin (^) + sin^ (^) ^ and VAR(Xm,„) is given by ([37]). Then 


(37) 


M 


Pr > 0|s = 1} = Pr < ^ sin 


m—1 


/"irn 


sin 


Tin 

~N 


)>o|=Pr|f;A„ 


> M sin ( 


/TrrtN Ii{k) \ 

\n) /o(k) j ■ 
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Define = \fM and t = \/M—The result in (1^ follows by 

applieation of Lemma [2l ■ 

Proposition shows that the pairwise error probability for the CC-NS ean be redueed expo¬ 
nentially with M. In addition, due to the union bound, the SER floor in (|T5]) ean be redueed 
arbitrarily to zero by inereasing M. 

C. High SNR Analysis for the FC-NS Case 

Similarly to RemarkfH for the FC-NS ease the following variables are defined Xm = linip^.oo ^ 
and ym = linip^oo whieh in the high-SNR regime simplify to Xm = hm and ym = 

The observation veetor v = [vi,... ,Vmi ■ ■ ■ ^vmY with = x*rJjm = is further 

defined. The derivation of the optimal deteetion rule even in this regime appears to be mathe- 
matieally intraetable. We seek to find an upper bound on the high-SNR SER floor by using a 
heuristie suboptimal deeision statistie. We assume that are i.i.d. VA^(0, n) for some k > 0 
random variables and we define the deeision veetor ^ = [(i ( 2 ]'^, where 


Cl 

A 1 


- — 

Em=l COS {fm + arg(s)) 

1- 

to 

M 



YZ.=i l^mpsin (0^-f arg(s)) 


The suboptimal deeision rule that we use is the minimum Euelidean distanee from a sealed 
N-FSK, i.e., 

s = arg min ||(^ — s|| , (39) 

s£S 

where S is the A^-PSK alphabet. Conditioned on the symbol Sq = [1 0]^ being sent and given 
the deeision rule (l39l) . an error oeeurs when e = {s 7 ^ so|s = sq}. Henee, the symbol error 
probability ean be upper bounded by the union bound as follows: 

N-l 

Pr {e} < ^ Pr {s = Sn|s = Sq} 

n=l 

where Sn = [cos(^) sin (^)]^- We provide here a lemma that is useful to bound (l40l) . 


(40) 
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Lemma 3 {Chebychev’s Inequality |[37ll ): Let X be a random variable with mean /i and 

variance For any e > 0, 2 

Pr{|X-/i| >£}<-. (41) 

Corollary 6 : The SER floor at high-SNR for the FC-NS case in (l40l) scales at least as O (^g). 
Proof: The pairwise error probabilities, Pr{s = s„|s = So}> bounded as follows 


Pr{s = Sn|s = So} = Pr{||so - Cll > ll^n - Cll|s = Sq} = Pr (sq - s„) < 0|s = Sq} 


(“) (^) VARff 

= Pr < 0} = Pr + E[e„] > E[e„]} < Pr {1^, - E[e„]| > Ei^n]} < —^ 


where ^ ^ ^ Em=i |/imP cos(0™) + 

' 27m\ \ II (a:) 


Em=i |/impsin(0m) and 


> 


E[,^„] = (1 — cos (^)) 7 ^- The inequality in (a) follows from the fact that {—+ E[,^„ 
E[^n]} C {|^n — E[^„]| > E[^„]} and (&) follows from Lemma [3l Calculation of the variance 
VAR(^„) gives 


VAR(e„) = 


(l-cos(^))^ 

M 


1 + 


h(K) 


hjn) 

^o(«) KloiK-) 


(sin(^))V 


M 




Hence a positive constant c independent of M can be found such that VAR(^„) < ci 


VARK„)=0(i). ■ 

Since the pairwise error probabilities are 0{^), from (l40l) the SER floor at the high-SNR for 
the FC-NS case is also 0{^). Hence, the SER floor for the FC-NS case can be made arbitrarily 
small as M —)■ cx). 

We conclude this section with a short intuitive explanation for the SER floors in both opera¬ 
tions. At high-SNR in the S operation every BS antenna observes exactly the same signal. As a 
result, no advantage can be gained by using multiple antennas. In contrast, in the NS operation 
each BS antenna observes the symbol perturbed by some independent phase rotation. Hence, an 
averaging effect is observed for the independent phase noise sources at the BS array. Similar 
conclusions were also drawn in El for the uplink case. 

We note that the single antenna transmitter is assumed to be phase noise free as our concern 
is the effect of the phase noise impairments at the BS array. The results are still valid in the 
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case of a phase noise impaired transmitter for the S operation with appropriate adjustment of 
the notation. However, in the NS operation the averaging effeet will not appear for the phase 
noise at the transmitter. Similar conclusions have already been drawn in prior work, sueh as 
EH, where it is shown that in the NS operation the phase noise impairments at the BS average 
out but not the ones at the user terminals. 

V. Numerical Examples 

In this seetion we present numerieal examples that verify the validity of the analytieal results 
presented in Seetions HII] and |IVl The examples are obtained by Monte Carlo simulations, where 
the reeeiver uses the deteetors from Seetion HII] to deeide on the transmitted information symbols. 
In Fig. |4]the zero mean von Mises distribution is plotted for referenee purposes for various values 
of the concentration parameter k. It is observed that the distribution is unimodal and that for 
K = 0 it eorresponds to the uniform distribution. As k inereases the distribution beeomes more 
eoneentrated around the mean. In the following we will refer to this distribution for our results, 
even though all the propositions and eorollaries in Seetion [III] hold for distributions that ean be 
parameterized as in dS]). 

In Fig. [5] the SER performanee as a funetion of the SNR p [dB] is plotted for k = 4 and M G 
{2,4, 6} for the eonstant ehannel ease. In all the numerieal examples for the CC ease is assumed 
to be g = [1,..., 1]^. In the low SNR regime array gains are observed for both operations and 
the synehronous operation is marginally better than the non-synchronous operation. However, 
the performanee of both operations in that regime prohibits reliable eommunieation. In the 
medium SNR regime (~ 0 — 10 [dB]) the non-synehronous operation has a elear advantage 
over the synehronous operation. SER floors in the high-SNR regime are observed in all eases. 
Specifically for the synehronous operation the SER performance saturates at the same value 
irrespeetive of M. This is in line with Proposition HI The dotted straight line eorresponds to the 
theoretieally ealeulated SER floor, (|3H . The theoretieal value 0.1418 is also shown. Superior 
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Fig. 4; The von Mises distribution, VA1(0, k), for 
various choices of the parameter k. 


Fig. 5: SER performance as a function of the SNR p 
for the constant channel case. The symbols are selected 
from a QPSK constellation and the concentration pa¬ 
rameter is K = 4 for various values of M. 


high-SNR performance is observed for the non-synchronous operation. In this case the SER floor 
is reduced when increasing the number of BS antennas. This verifies the results in Sections [I V-B I 
and HV^ 

In Fig. the SER performance as a function of p [dB] is plotted for k = 4 and M E {2,4, 6} 
for the fading channel case. For the synchronous operation, an array gain is observed in the 
low SNR regime and the SER floor is the same for all the values of M. These observations are 
identical to the constant channel case and are in line with Proposition IH The theoretical SER floor 
is also plotted as in Fig. [51 Analogously to the constant channel case, SER floors are observed 
in the FC-NS case as well. Further, the SER floor is reduced by increasing the number of BS 
antennas. This is also in line with the results in Sections [IV-BI and IIV-CI For the Figs. [5] and [6] the 
detectors in (fTSl) and (1251) were computed so that the relative accuracy, Sacc{^), of the truncated 
metric with z/ terms for the s constellation symbol, Cs{i^), defined as 6acc{^) = 
was less than 10“^^. In Table jll] we summarize the mean and the maximum number of terms 
required to achieve the relative accuracy, Sacc{i^), for a set of values of p [dB]. We observe that 


Cs{v)-Cs(y-l) 

Cs(u-l) 
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Fig. 6: SER performance as a function of the SNR p for 
the fading channel case. The symbols are selected from 
a QPSK constellation and the concentration parameter 
is K = 4 for various values of M. 



Fig. 7: Comparison of the SER performance as a 
function of the SNR p for the CC and EC cases. The 
symbols are selected from a QPSK constellation and the 
concentration parameter is k = 10 and for M = 5 BS 
antennas. 



2 dB 

10 dB 

22 dB 

CC 

FC 

CC 

FC 

CC 

FC 

Mean 

S 

13.8 

12.7 

16.2 

16.2 

15.7 

17.1 

NS 

10 

12.5 

13.8 

16 

15.8 

16.6 

Max 

S 

17 

16 

17 

17 

16 

18 

NS 

13 

16 

17 

17 

17 

18 


TABLE II: Mean and maximum number of terms, v, in the truncated sums of the optimal detectors from (fTSl) and 
dB) for relative accuracy of 6acc{i^) <10 M = 6 and k = 4. 


as p increases the number of terms required inereases as well. However, in all eases less than 
20 terms were suffieient for the desired relative aeeuraey. This demonstrates the faet that with 
only a few terms ineluded in (fT5l) and (l25l) . the truneation error beeomes negligible. 

In Fig. |7] the SER performanee is plotted as a funetion of the SNR p [dB] for the eonstant 
ehannel and fading ehannel eases. Similar performanee is observed for the synehronous operation 
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in both CC-S and FC-S cases. In the low SNR the eonstant ehannel exhibits better performanee. 
This is due to the randomness of the fading ehannel ease. However, as the number of BS 
antennas inereases the squared norm of the fading ehannel veetor normalized by M beeomes 
almost deterministie, so this gap is expeeted to be redueed. In the high-SNR regime the SER 
floor is the same for both the eonstant ehannel and the fading ehannel ease. This is in agreement 
with Remark [TJ The theoretieal SER floor for the CC-S and EC-S eases is also given by the 
dotted line. Eor the non-synehronous operation improved performanee is observed in the eonstant 
ehannel ease. This implies that the additional randomness due to fading in the EC-NS ease has 
a direet impaet on the SER performanee. This is an expeeted behavior. In the absenee of phase 
noise the CC-NS ease eorresponds to the deteetion in an AWGN ehannel with eonstant gain 
and the error probability ean be expressed in terms of the Q-funetion. In the absenee of phase 
noise, the EC-NS ease eorresponds to the eoherent deteetion in a Rayleigh SIMO ehannel with 
perfeet ehannel knowledge and the error probability seales only as p~^ [[38ll . This phenomenon 
naturally earries over when phase noise impairments are present. 

A. Extension to Longer Data Intervals 

In praetiee more than one ehannel uses are spent for data transmission. In this seetion we 
demonstrate that the eonelusions drawn by the study of the models in (U) and ([7]) are valid for 
transmission protoeols with multiple ehannel uses. Eor this purpose, we eonsider a setup where 
the data interval is extended to T ehannel uses. The ealeulation of the optimal deteetors in elosed 
form appears to be intraetable IfT^ . [fT^ . lfT9]| . [12T]| . Henee, the approaeh ean be summarized 
as follows. If a suboptimal traetable deteetor for the NS operation performs better than a genie- 
aided, i.e. better-than-optimal, traetable deteetor for the S operation, then the same will hold for 
the eorresponding optimal deteetors. 

We deseribe the approaeh for the EC ease as the CC ease follows in a similar fashion. Eor 
the EC-NS ease the seeond equation of ([7]) is extended to = ^/p^thst + Zt, for t = 1, ..., T, 
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where the m-th element of the diagonal matrix is and the increments 0m ['r] are 

i.i.d. distributed according to dS]). In the following we provide a suboptimal causal SBS detector 
with decision feedback for the symbol St- At time t the information symbols up to f — 1 have 
been detected and this information is used by the detector as if the detected values were the 
true ones. Further, for simplicity the detector substitutes the random rotations of the phase noise 
rotations at time 1 < r < f — 1 with their statistical mean dmH = E |^e-^ E?=i0m[r]j _ 
where we make the simplification that the phase noise statistics along the BS antennas are the 
same. Hence, the detector assumes that at some r G [1, f — 1] the received vector is observed 
via = y/pdiag ..., ['?']} hsj- + z^. Finally, at time t the phase noise impairment 

is assumed to have the same statistics as the accumulated phase noise increments up to t, 
^*^i0m[T]. Hence the causal decision feedback detector selects the st ^ S that maximizes 
the likelihood p{x, y^,... ,y^\si,..., St-i, s0. The final result follows by similar steps as in 
Proposition m i.e. 


St = arg max 

M 


-Min [at + 




^Wvtf 


f r fMXm[t]\ 
> In ao,tIo ,2 

^ ' \at + p\st\^ 


m=l 


2 ^ Oip^tlp 

p=i 


at + p\st\‘^ 

/ 2p|xm[f] | \ (p(arg(xm[f]))) 

\at + p\st\^J 


(42) 


where a* = 1 + p + pEr=l , Vm[t] = X*^ + y*^[r]dm[T]Sr, Xm[t] = V*m,ty*m[tW 

The sequence ao,t, ai,t,... is the Fourier expansion of the pdf of Y1t=i 4>m[x]. As noted, this 
detector is suboptimal but implementable. Hence, the optimal causal detector for the FC-NS case 
must perform at least as good as this suboptimal detector. 

In the following, we provide a genie-aided detector for the FC-S case. In this case, at time t 
the received vector is given by y^ = Assume that at time t, the receiver is 

aware of 7)0] = X]t=\ *^he true si,..., st-i. The causal ML detector selects the symbol 

St £ S that maximizes the likelihood p{x, y^,..., y^\sl,..., St, 'd[t]), i.e. 
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Fig. 8: Comparison of the detectors (l43l l and (l42l i for M = 20 and T = 20. 


St = argmax —Min (a* + p|stp) + 


p{\\vt-if + \ 

Si 


at + p\st\ 

2 


(43) 


In aolo 




o-t + p\st\ 


2 o.pIp 
p=i 


2p|Xt 


cos(p(t9[f] - arg(xi))) , 


^ \at + p|sip^ 

where a* = 1 + p + pYf-TJi krp, Vt-i = x + Xlt=i and Xt = StV^_iyt. This 

deteetor performs better than the aetual eausal ML deteetor for the synehronous operation, sinee 
it has the additional knowledge of the true prior symbols and the evolution of the phase noise 
proeess up to f — 1. In Fig. [8] the SER of the two deteetors in (l43l) and (l42l) is plotted for 
data interval length T = 20 and QPSK symbols. The phase noise increments (prn[t] are assumed 
to be i.i.d. zero mean wrapped Gaussian with variance = 0.07. This corresponds to the 
practical scenario of free-running oscillators. In this case, the pdf given by ([3]), where am,p = 
exp y — 11/811 . It is clear that the suboptimal FC-NS detector performs better than the 

’better-than-optimal’ FC-S detector, which establishes the claim that the results of the previous 
sections are valid in more complex setups. 


VI. Conclusions 

The problem of MF detection in a training-assisted single-user SIMO channel with phase noise 
impairments and M BS antennas was studied. A simple transmission protocol was considered. 
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where one time slot is used for pilot and the seeond time slot is used for data transmission. The 
study ineluded the ease of eonstant and fading ehannel assumptions. For both assumptions on 
the ehannel gains two operations were investigated, i.e., the synchronous and non-synchronous 
operations. Closed-form expressions of the optimal detectors were given for a general param¬ 
eterization of the phase noise increments. SER floors were observed for all cases under study. 
For the synchronous operation the SER floors were independent of M for both the constant and 
fading channel case. In contrast the SER floor in the non-synchronous operation could be made 
arbitrarily small by increasing M. 

The effects discovered in the paper are fundamental because they assume optimal detection 
in the Bayesian sense, as the detector was found by analytical marginalization of all nuisance 
parameters, i.e., the phase noise in the CC case and the channel fading and the phase noise 
in the EC case. That is, no filtering algorithm, alternative or additional, can improve the SER 
performance of any of the operations, S and NS. The observed SER floors and the distinction 
between S and NS operation are, thus, fundamental and not an artifact of suboptimal receiver 
processing. Eurther, we have shown that our results remain valid if the data interval is extended 
to multiple channel uses. 

Appendix A 
Two-Slot Proofs 

A. Proof of Proposition [U/or the CC-NS case 

Define the vectors 6 = [9i,..., 6>m]^ and 4> = [0i, • • •, fnY■ The likelihood function is given 


by 


p{x,y\s)= II p{x,y\s,O,(j))p{e,(f)\s)d0d(l)= 11 p{x\e)p{y\s,0,^)p{0)p{^)dOd^ 


(6) 


M 


M 


j fm)p(^fm)d(j)^ dO^n p{XjYii ym I ) 


m=l 


n=l 


=Piym\9m,s) 


(44) 
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where (a) follows from the fact that conditioned on 0, (p and s, the vectors x and y are 
independent and that 0, 0 are independent of each other and s. The equality in (b) is a 
consequence of the independence of the components in 6, cp, w and z. The channel probability 
law p{ym\s, Qmi (pm) Can be expressed as 


1 dmi (pm) 


cos(</im+ 6 'm-arg(s*ym)) 


TT 


TT 


g-|ymp-p 3 ^|sp 


TT 


h{‘^\fpgm\s*ym\) + 2 ^ Ii{2^gra\s*yjn\) COS (/ {(pra + Om - arg (s*ym))) j , 


1=1 


(45) 


where the last step follows from the Jacobi-Anger formula 11^ Section 4.4, p. 100] 


g«cos/3 = /^(a) + 2 ^ Ii(a) cos(//3). 


(46) 


1=1 


Then, the conditional p(ymi^m,s) is 


P{,ym\^mi ^) I Pi^Pml^ 1 ^mi (pm)p(^(pm)(^(p 
J — 

g-|Pmp-pP^|s| 


TT 


®m,0'^o(2\/p5^m|'S l/m|) T 2 ^ ^ (2^/p^j,^ |S I/ml) COS (/ (^m arg (s Pm))) 


1=1 


By manipulating p{xm\dm) in the same way as in (1451) and by the orthogonality of cos and sin 
we obtain 


P(^m)l/m|'5) / P(^m I ^m)p(^m)p(l/m I ^m) 

J —TT 

p-|a;mp-|?/mp-P 9 ^(l + |sP) 


TT^ 


/5m,0 T 2 'y ^ p)m,l COS (/(^n 


i=l 


( 47 ) 


The result in (19|), (fTOl) . (fTTI) and (fT^ follows by substituting (|47]) in (l44l) . 
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B. Proof of Proposition [U/or the FC-NS case 


The likelihood function in this case is given by 


(a) 


p{x,y\s) = / p{x,y\s,(f>,h)p{h,(f>\s)dhd(f) = / p{x\h)p{y\s,(f>,h)p{h)p{(f))dhd(f) 


= / P(0) / pih)p{y\s,cl),h)p{x\h)dhdci) 


ib) 


M 




(48) 


_1 ^ -TT 


m=l 


—p(^r7i I 

where (a) follows from the fact that conditioned on s, h and (p, the received vectors x and y 
are independent and from the assumption that s, h and cp are mutually independent. Further, 
the factorization in (&) follows from the fact that the components of h and cp are independent. 
It holds that 


P{x^\hm) = 


3 y/ph-n 


3 \Xm I ^ p\hm I ^ 


TT 


TT 


■exp pdi{2^x*^h ^}), 


(49) 






TT 


TT 


exp (3? . (50) 


From 


and p{hm) = -e the conditional pdf p{xm-, 




IS written as 


p{Xm^ ym\^^ pm) / p((^m)p(|/m1 5, 0m) |^m)d/l7i 




TT'^ 


exp - (p + p|sp + 1) 


\K 


+ S*y^e k 

I p + p|sP + l 


dhr, 


For notational convenience we define Cm = ^ and p{xm,,ym\s,pm) is expressed 


as 


P(a^m) l/m I"S) 0m) 




TT"’ 


m 


'C 


(a) e 


-|3:mp-|PmP + (p+p|sP + l)|Cm| 


^ , ,n \, ,2 C?,'! g“l^™-l^“l2/™'l^ + (p+^l'^l^ + ^)k™-l^ 

g-(p+p|s|2 + l)|h„,-C„,|=^^ (J ^ 


TT-^ 


'C 


TT^ (p + p|s|2 + 1) 
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where in (a) we complete the square and (b) is the integral of a complex Gaussian function over 
C. By using these expressions we can compute 


2/m I'S) / p(^m)p(^my 


7r2(p + p|s|2 + l) 

For convenience of the notation, we define 

l+p|«l^ It |2 1+p I., |2 

^ e P+Phl^ + l'"^"*' p+p|sp+ll2^"‘l 




TT^ (p + p|s|2 + 1) 


A 2p|s*xj;,2/^ 

5 ^m,2 — 


P + p|s|2 + 1 


, and = arg(s) - aig{x*^y^). 

(52) 


By substituting dH) and (l52l) into (ISTl) 

P{Xm, ym\s) = [ ( amfi + COS {^m) ) 


1=1 


^m,l® 


m,0 


^ ^ cos (/0^) cos(<a„.+a„,,3)^0^ 


27r 


27r 


i=i 


■^m,l^m,0^Q (24^71^) 
, -^m,l 


TT 


^ ^ (^m,l I COS {l(j)m) I loi^mp') “f 2 ^ ^ ffc(^m,2) COS (/c {(j)m “f ^m,3)) I cl(/>7i 


i=l 


fc=l 


(^Imp) COS (//Im 3)^ . (53) 

Since p(xm,Pm|s) is a density function and hence it is absolutely integrable, the interchange 
of summation and integration is possible by the Fubini-Tonelli theorem ll37]l . The result in (|9l), 
(fTBl) . (fT4l) and (O follows by substituting (l52l) and (15^ into (1481) . 


= 


m,l 


(^771,2) H“ 2 ^ ^ 


1=1 


C. Proof of Proposition |2]/or the CCS case 
The likelihood function in this case is factorized similarly to (l44l) as 

p{x,y\s) = Jp{x\e)p{e) j p(y|s,6',0)p(0)d0d6>. (54) 

The integrals in (l54l) can be calculated using a similar approach as in Appendix lA-AI yielding 
(I2Q1), CQl), ^ and (1221). 
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D. Proof of Proposition |2]/or the FC-S case 


The likelihood function in this case is factorized similarly to (l48l) as 


p{x,y\s) 



p{y\s, (j), h)p{h)p{x\h)dhd(j). 


(55) 


The integrals in (1551) can be calculated using a similar approach as in Appendix lA-BI yielding 
dlOl), ^ and dal. 


E. Proof of Proposition |5] 

The proof is similar to the proof in Appendix lA-Bi We substitute the von Mises pdf p{(j)m) = 
27rio(l7 evaluate the resulting integral. This yields (l29l) . The proof for (l30l) is similar 

to the proof for (l29l) . 


Appendix B 
T-slot Detectors 

In this appendix detectors are presented for the T-data-slot extended transmission protocol. 
Due to the fact that the derivation of the optimal detectors in the maximum-likelihood (ML) 
sense in closed form appears to be intractable, suboptimal detectors for the non-synchronous (NS) 
operation and genie-aided, i.e. better-than-optimal, detectors for the synchronous (S) operation 
are derived. By simulations it is shown that the performance of the suboptimal NS detectors is 
superior to the genie-aided S detectors. This conclusively establishes the claim that the results 
of the two-slot model are valid for the extended models as well. 

A. Decision Feedback Detector: Non-Synchronous Operation 

We consider a transmission interval of T -I- 1 channel uses. Training is done during the first 
channel use and uncoded information symbols are transmitted during the subsequent T channel 
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uses. The received vector during training, x, and the received vectors during data transmission, 
y^, at time t are given by 


X = ^&g + w, (56) 

Vt = y/p&^tgst + Zt, t = (57) 


where © = diag ..., and = diag |e-^ ..., The phases 

9m, m = 1,..., M are random i.i.d. initial phases uniformly distributed in [—tt, tt). The phase 
noise increments 0m [^] are i.i.d random variables with pdf parameterized by 


P'3?m[r](0m ['?’]) 


1 



m,p 


COS (p0mH) 


(58) 


The vector g G is a constant and known vector of amplitudes and st are data symbols 
selected from a fixed constellation S. In order to derive a suboptimal detector for the symbol 
St we consider that the receiver uses the detected symbols si,..., St^i as if they were true. 
Further, the exact knowledge of the phase noise increments is not available and can be only 
estimated. For simplicity of the detector we assume that distortion due to phase noise is equal to 
the expected value of it conditioned on the matrix of the initial phases 0. Hence, the received 
vectors y^, 1 < t < t — 1 are assumed to be observed by 


y^ = y/p&DrgSr + Zr, I < T < t - 1, (59) 

where = E [$,-] = diag ..., where a!m,i,r are the coefficients of the Fourier 

expansion of the pdf of J2i=i^rn[T]- Finally, at time t the received vector is given by (l57l) . 
where the phase noise perturbation equals to the total accumulated phase noise up to time t, 
0m ['^]- The suboptimal causal decision feedback receiver is given by 

St = aTgmaxp{x,y\\s\~^, St), 


(60) 
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where we denote the sequence = {va, ■ ■ ■ ,Vh}. The detailed steps of the derivation follow. 
The likelihood p(x, St) is written as 

p(x,y{js{-\st) = j j 

II "{l/m[^]}l I"Sl y Pm\t\')p(^^m)p(^Pm\pY)dOrndprn\t\• 

The derivation proceeds with marginalizing over the accumulated phase noise Pm[t]- 

/TT 

pip^mi {ym [^] }l I Pm [f] ')pi,Pm [^] )d(^m [f], 

-TT 

where 


p{Pm[t]) = — + 2^Q!p,m,tCOS (p93m[f]) 

Note that here the coefficients of the Fourier expansion of p{pm[t]) depend on t, since the larger 


the t the larger the variance of Pm[t]- The density p{xm, {ym[T]}\\s\ St, Om, Pm[t]) is given by 

piXm, {ym[T]}\\s{-\ 9^, Pm[t]) = \yr.[r]-ppe^^rr. 


IT' 


X e 


-\ym [i] - pf’m lt]g^ St I = 


With the use of the Jacobi-Anger formula, the phase noise term Pm[t] can be marginalized out 
and the density p{xm, {ym[T]}{\s\~^ , St, 6^) is given by 

piXm {ymYW~^ St Orn) = ——er'^^‘^'d-T!r = l\yr'<-l'^\?-P9'L{b^{i] + \st\'^)+2pp\Crn[t]grn\cOs{ern+a.rg{Crn[t]grn)) 
X ^0,mM‘2^/p\Bm[t]\) + 2'^ag^^^Jq{‘^y/p\Bm[t]\) COS (g (6'^ + arg (^^[f])))^ , 

where Byn[t] = y^[t]gmSu bm[t] = 1 + Zlt=l Cm[t] = X*^ + Z]t=l 

Further marginalization with respect to 9^ can be carried out which yields the density 

/ TT 

{Z/m[^] }o I'^1 ^ ^7n^p(,^rn)^^rn 

-TT 


IT' 


t+1 


2 ^ ^ |)-^p(2-\/p|c772[t]5^772|) COS (p (s-rg 8-rg (c^)) 

P=1 
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The detector is finally given by 


M 


m=l 


St = argmax -p\st\‘^\\g\\‘^ + In ( ao^rn,tIo{‘^y/p\Bm[t]\)Io{‘2y/p\cm[t]gm\) 

st&S ^—' ' 


(61) 


2 ^ |)4(2\/p|Cm[t]5'm|) COS {p (aig {Bm[t]) - arg { Cm [ t ] gm ))) j • 

P=1 


B. ’Better-than-optimaV Detector: Synchronous Operation 

The system model in this case follows trivially from (l5^ and (1571) by setting 9^ = 0 and 
(pjn[t] = (pt, Vm e {1,..., M}. Hence, the system model at time t is given by 


X = y/pe^^g + w, (62) 

yt = +Zt, f = l,...,T, (63) 


where 6t = 6 + X]t=i "*"he likelihood for the causal ML symbol-by-symbol (SBS) detector 
for the symbol St is given by 

p{x,y\\st)= 

St 1 

The derivation of a closed-form expression of the above decision rule appears to be mathe¬ 
matically intractable. Hence, we are interested in deriving a genie-aided, ’better-than-optimal’ 
detector, that will yield an optimistic performance bound on the detector in (l64l) . Assume that 
before the detection of the symbol st, a genie provides the receiver with the exact knowledge 
of Ot- Then, the likelihood for ML causal SBS detector for st is 


p{x, y\\0o, (t)\, s*)p(6'o) JJp(0r)d6»od0i. 


(64) 


T=1 


/TT 

p(?/t|^t,0i,st)p(0i)d0t, (65) 

-TT 

where 






TT 


M 


o-\\yt\\^-p\st\‘^\\9\\ 


71 


M 


:,2yp|yfgst I cos((/)t+6»t+arg(st3/fg)) 


( 66 ) 
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From (15^ and (1^ the likelihood in (1^ is given by 

3-ll?/tlP-pktPll9lP 


p{yM,st) = 


TT 


M 


+ COS {p {Ot + arg {styfg ))) j , 

p=i 


where fip = apip {2^\y^ gst\) ■ Finally the detector is given by 


(67) 


St = argm^ -p\st? \\gf + In (/3o + 2^/3pCos [p (6>t + arg {styfg))) ) • 

p=i 


( 68 ) 


C. Decision Feedback Detector: Non-Synchronous Operation 


The system model for the fading channel case is given by 


X = ^/ph + m, 
yt = y/p^thst +Zt, f = 


(69) 

(70) 


where the difference with (15^ and (1571) is the fading channel ~ A/c(0, /m)- The initial phase 
reference © has been absorbed into h without modifying its statistics. The causal SBS decision 
feedback detector is derived also similarly to the constant channel case by assuming the following 
auxiliary model 

X = y/ph + m, 

y^ = y/pDrhSr + Zr, T = 1, . . . , f - 1 
yt = y/p^thst + zt. 

The decision rule for this detector is given by (l60l) . The density p{x, y\\s^^^, St) in this case is 
given by 


p{x,y^\s^ ,st) = 


p{x, y^\s^ , St, ^t, h)p{h)p{^t)dhd^t 


M 


m=l 
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At the first step of the derivation the ehannel hm is marginalized out whieh yields the result for 
the density p {xm, {ym[r]}\\s\~^, st, ^m[t]) 

1 


P {Xm, {ym[r]}\\s\ \st,Pm[t]) = 



■ li 


+ 2^ 


MXm[t]\ 


, ,0 , r p(l«m[i]|2 + l!/m[t]|2|si|2) 


‘2p\Xm[t]\ 

dm[l] “f Pi'StI J p—l 


COS {p {ipm[t] + arg {Xm[t]))) , 


where a^[f] = 1 + P + pX]t=l MmHSrP, Vm[t] = X*^ + Yll=iy*m[x]dm[T]Sr and Xm[t] = 
v^[t]y'^[t]st. In the following, the variable Prn[t] is marginalized out. 

p{Xm,{ym[T]}\\s{~^,St) = J P (x^, {|/m H } ^ | Si"\ S*, [f]) [t] )d(p™ [f] 

1 _u |2_y^. I, 2, 

\x^\ = l \y^[r\\ + am[t]+p\st\-^ 


7r*+^ {dm[t] + p\st\^) 

2p|Xm[f]| 


^ I Oi 0 ,m,tdo 


dm[t] + p\St\‘^ 

Finally, deteetor is given by 


+ 2 ^ ^ Oip 


p=l 




M 


M 


St = 


arg max — In [am[t]+p\st\^) + J2 


m=l 


m=l 


P{\Xm[t]\‘^ + \ym[t]\‘^\St\‘^) 
dm[t] + P\St\^ 


(71) 


M 


I T ( 2p|Xm[l]| 

^ ° (a.+ 


m=l 


+ 2 ^ ap,m,t7p cos {p (arg (x. 


p=i 


im 


D. ’Better-than-optimaV Detector: Synchronous Operation 
The system model is given by 

X = y/ph + w, 

Vt = + Zt, t = 1,... ,T. 


(72) 

(73) 


The likelihood for the eausal ML symbol-by-symbol (SBS) deteetor for the symbol s* is given 
by 


p{x,y\\st)= ^ 


p{x, y\\h, 0*, s\)p{h)W_p{(l)^)dhd(l)\. (74) 


S'T G<S, 


r=l 
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We derive a genie-aided, ’better-than-optimal’ deteetor, that will yield an optimistie performanee 
bound on the deteetor in (1741) . Assume that before the deteetion of the symbol St, a genie provides 
the reeeiver with the exaet knowledge of X]t=\ the previously transmitted symbols, 

Then, the likelihood for ML SBS deteetor for St is 


p(a;,2/i|0i ,Si ,Si) = 


—TT 

t-l 1 


p{x,y^\h,(j)^ , st)p{h)p{(j)t)dhd(j)t (75) 


The eonditional density p{x, y\\h, (p\ , s{ St) is given by 


p{x,y^\h,(j)^ ,st) = 


1 -W^-vp'^W^ 


A 


Vt- 




Define at = l+P+pEt=i krP and Vt = a3+X;t=i The density p(cc, y\\(p\ \ (pt, Si \ St) 

follows direetly 


P{x,yi\(j)i ,0t,Si ,st)= p{x,y^\h,(j)^ ,st)p{h)dh 

Jc^ 


II II2 II ,1. II2 

-||a;|| -ET=illyTll - 


p(||lit_l|P + |st|2||j/t||^) ^ cos(</)t+2;(A]^ c^r-arg(sjp|^j^j/t)) 

“t+P|atP “t+PktP 


(tt^+i {at + p\st\^)) 

By further marginalization of cpt and with the definitions 

l|2 


M 




-Ikll -EU.brl 


2 p(||Pt-lll^ + l®tl^ll2'tll^) 
“t+P|atl^ 


(7r*+i {at + p|stp)) 


M 


A, = 


Ms*tVt-iyt\ 

at + plstl"^ 


t-i 


As = - arg {s;vf_^y^) , 


T=1 

t-l 


the likelihood p{x, y\\(j)i , (pt, s\ , s*) is given by 

( CX) 

aolo (Aa) + 2 ^ Oplp (Aa) cos (pAg) 

p=i 

Finally the deteetion rule is 

p(ll^t-ill^ + |stP \\ytf) 


St = arg max 

st^S 


In aolo 


at + p\st[^ 


Min [at + p\st\'^) 


Mx[t]\ 

Oi + p|stP 


+ 2 ^ apip 


p=i 


2p|xWI 


at + p\st\ 


't-i 


- j cos j p E (pr - arg {x[t]) 


\ 7" = 1 


(76) 


( 77 ) 
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where x[t] = StV^_iyt. 

E. Numerical Examples 

In this seetion, we provide some numerieal examples to investigate the performanee of the var¬ 
ious derived deteetors. In all the examples, the phase noise increments 0m [f], m = 1,..., M, t = 
1,..., T are assumed to be i.i.d. zero mean wrapped Gaussian random variables with variance 

9 

( 2 \ p 

— . Also in all 

the Figures M = 20 and the length of the data interval is T = 20. In Fig. |9]the performance of 
the genie-aided receiver for the synchronous operation in (IMI) is compared with the suboptimal 
decision-feedback detector for the non-synchronous operation in (IMl) for the constant channel 
case. The input symbols are selected from a QPSK constellation with equal probability and the 
variance of the phase noise increments is cr^ = 0.1. In Fig. [TOlthe uncoded SER performance 
of the detectors (IMI) and (IMl) is shown as a function of p for equiprobable 8-PSK symbols. In 
Figs, [m and [l2l the uncoded SER performance of the detectors in (ItTI) and (1771) is plotted as 
a function of p. In Eig. [H] the symbols are QPSK with equal probability and = 0.07, and 
in Pig. [I2] the symbols are equiprobable 8-PSK symbols with = 0.01. In all cases shown 
in Pigs. l9lfT2l it is clear that the decision-feedback non-synchronous detector outperforms the 
corresponding genie-aided synchronous detector. This establishes the validity of the results to 
setups with more data channel uses. 
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